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ABSTRACT 

This  statistical  study  assesses  the  change  of  Alberta  climate  using  the  monthly  data  of  the 
following  climate  variables:  1)  surface  air  maximum  temperature,  2)  surface  air  minimum 
temperature,  3)  total  precipitation,  4)  total  rainfall,  and  5)  total  snowfall.  The  study  period  is 
1884  -  1996  using  data  from  38  long-term  climate  stations  in  Alberta.  The  weighted  average  of 
individual  station  values  was  used  to  obtain  an  Alberta  average  of  climate  variables  for  each 
month.  Both  uniform  weights  (for  the  arithmetic  mean  or  uniform  average)  and  optimal  weights 
(for  the  optimal  average)  were  used.  The  optimal  average,  in  contrast  with  the  uniform  average, 
is  a  more  appropriate  integral  measure  for  a  climate  data  field.  The  Alberta  averages  were 
smoothed  using  an  eleven-year  running  mean  to  reveal  the  pattern  of  variation  in  time.  Based  on 
the  running  means  of  optimal  averages,  it  was  observed  that  the  maximum  temperature  does  not 
have  an  upward  trend,  whereas  the  minimum  temperature  has  a  clear  upward  trend  of 
approximately  0.8  degree  C  since  about  1920.  Total  annual  precipitation  appears  to  increase 
slightly  since  the  1920s.  However,  the  data  are  much  more  variable  and  statistical  testing  is  need 
to  confirm  the  results. 
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INTRODUCTION 

Studies  have  shown  that  global  average  temperature  has  increased  by  as  much  as  0.6  degree  C 
over  the  last  130  years  (IPCC,  1996).  The  trends  of  temperature  and  other  climate  variables  over 
smaller  areas  are  known  to  vary  greatly  from  place  to  place.  The  purpose  of  the  present  study  is 
to  examine  the  trends  of  climate  variables,  such  as  temperature  and  precipitation,  over  the 
province  of  Alberta  and  apply  an  averaging  method  that  would  provide  a  better  representation  of 
Alberta's  climatic  trends. 

The  spatial  averages  of  climate  variables  are  commonly  used  parameters  to  represent  the  state  of 
the  climate  over  an  area.  A  common  use  of  these  averages  is  in  the  estimation  of  climatic  trends. 
The  simplest  approach  is  to  use  the  arithmetic  mean  over  the  stations.  In  the  present  study,  an 
optimal  averaging  method  is  applied  over  Alberta,  incorporating  the  variance-covariance 
structure  among  long-term  Alberta  climate  stations.  Climatic  trends  were  then  computed  using 
this  optimal  average  to  provide  an  integral  measure  of  climatic  variation  over  the  province.  The 
optimal  average  incorporates  the  effect  of  the  distribution  of  climate  stations  and  considers  the 
dependence  of  data  from  adjacent  sites. 

For  simplicity  of  presentation,  the  numerous  graphs  and  tables  are  numbered  and  included  in 
Appendix  A.  Only  the  more  important  ones  are  labelled  alphabetically  and  contained  in  the  main 
text. 

CLIMATE  DATA 

In  the  present  study,  38  long-term  stations  in  Alberta  were  selected  for  analysis.  These  are  listed 
in  Table  1.  (See  Appendix  A.)  The  selection  represents  a  reasonable  cross-section  of  stations 
over  Alberta.  The  regions  represented  include  the  North  Saskatchewan  River  Basin  (NSRB),  the 
Red  Deer  River  Basin  (NDRB),  the  South  Saskatchewan  River  Basin  (SSRB),  the  Milk  River 
Basin  (MRB),  the  Rocky  Mountains  (ROCK),  the  Athabasca  River  Basin  (ARB)  and  the  Peace 
River  Basin  (PRB).  The  locations  of  the  selected  climate  stations  are  shown  in  Figure  1. 
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The  study  period  is  1884  to  1996  and  monthly  values  were  used.  Of  the  38  stations,  the  actual 
number  of  stations  with  data  available  varies  over  the  study  period.  In  general,  the  number  of 
records  increases  with  time  from  1884  to  the  late  1980s  and  then  decreases  after  1990.  This 
pattern  is  shown  in  Figures  2  to  3  for  temperature  and  precipitation. 

STATISTICS  OF  ALBERTA  CLIMATE 

Averages  of  Reference  Period  (1961-1990) 

In  the  present  study,  only  precipitation  and  temperature  data  were  analyzed.  The  temperature 
variables  are  maximum,  minimum  temperatures  and  the  precipitation  variables  are  rainfall, 
snowfall  and  total  precipitation  observed.  Total  precipitation  includes  snowfall  measurements 
converted  to  water  equivalent.  The  climatic  normals  of  temperature  and  precipitation  are  usually 
represented  by  the  long-term  averages  computed  over  a  period  of  30  years,  as  adopted  by  the 
World  Meteorological  Organisation  (WMO)  and  its  member  countries  (IPCC,  1996).  The  long- 
term  averages,  or  normals,  for  these  variables  at  each  of  the  selected  stations  were  computed  for 
each  month  using  the  30  years  of  data  from  1961  to  1990.  This  is  also  the  official  reference 
period  used  in  the  latest  IPCC  report  (IPCC,  1996,  p.  141).  These  averages  were  used  here  as  the 
measure  of  central  tendency  from  which  climatic  variations  were  assessed. 

The  monthly  averages  of  daily  maximum  temperature  and  daily  minimum  temperature  over  the 
reference  period  for  the  individual  stations  are  listed  in  Tables  2  and  3.  In  addition  to  the 
latitudinal  variation  of  maximum  temperature,  one  can  also  see  a  smaller  range  of  maximum 
temperature  over  the  province  in  summer  than  in  winter.  For  example,  the  largest  difference 
(17.9  °C)  in  January  is  between  Cardston  (-0.3  °C)  and  Vermilion  (-18.2  °C)  whereas  the  largest 
difference  (6.9  °C)  in  July  is  between  Medicine  Hat  (27.3  ""C)  and  Lake  Louise  (20.4 ''C).  The 
same  seasonal  variation  also  exists  in  the  range  of  minimum  temperature  over  the  province.  In 
this  case,  the  January  range  is  15  °C  between  Cardston  (-12,8  °C)  and  Fort  Vermilion  (-27.3  °C) 
and  the  July  range  is  8  °C  between  Medicine  Hat  and  Lake  Louise. 

The  precipitation  data  demonstrate  high  spatial  and  temporal  variation.  This  is  expected  within 
the  Province  of  Alberta  with  its  diverse  terrain  and  latitudinal  extent.  Annual  rainfall  of  Rocky 
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Mountain  House  is  about  1.5  times  higher  than  that  at  Brooks  and  annual  snowfall  at  Lake 
Louise  is  approximately  3  times  higher  than  at  Gleichen.  The  spatial  and  temporal  variations  of 
total  precipitation  are  shown  in  Table  6. 

The  averages  of  temperatures  and  precipitation  for  Alberta  over  the  reference  period  are  shown 
in  Figures  A  and  B.  The  individual  station  averages  vary  to  different  degrees  from  these  Alberta 
averages.  A  major  consideration  in  the  present  study  is  to  incorporate  individual  station 
variances  and  between  station  correlations  of  the  climate  variables,  when  computing  the  overall 
weighted  means  through  the  use  of  optimal  weights.  It  will  be  shown  that  significant  differences 
exist  between  the  optimal  average  and  the  uniform  average. 

Variances 

The  variances  of  climatological  data  may  not  be  entirely  due  to  the  natural  climatic  variation. 
Measurement  and  other  errors  also  contribute  to  the  variance  of  the  observations.  However,  it  is 
expected  that  the  contribution  of  such  errors  is  small  relative  to  that  from  natural  variability. 
Variances  must  be  considered  in  computing  the  overall  average  of  a  climate  variable  for  the 
province. 

In  the  present  study,  the  variance  of  the  data  is  represented  by  the  mean  squared  departure  from 
climatology.  The  monthly  anomalies  were  computed  by  subtracting  the  climatic  normals 
(reference  period  averages)  from  the  values  observed.  The  sum  of  squares  of  the  anomalies 
divided  by  the  total  number  of  records  was  used  as  the  variance  of  the  climate  variable.   This  is 
slightly  different  from  the  usual  statistical  definition  of  variance,  but  is  more  meaningful  in  the 
present  context  because  variability  is  expressed  in  terms  of  departures  (anomalies)  from  the 
reference  period  climatological  mean.  The  results  are  shown  in  Tables  7  through  1 1  for  the 
temperature  and  precipitation  variables. 

The  Alberta  Average  of  Climate  Variables 

For  an  integral  measure  of  a  climate  variable  over  a  region  such  as  the  province  of  Alberta,  it  is 
necessary  to  convert  the  point  measures  of  station  observations  to  an  areal  measure  for  Alberta  as 
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a  whole.  In  this  study,  a  uniform  average  (arithmetic  mean)  and  an  optimal  average  were  used. 
Both  these  averages  can  be  represented  as  weighted  averages  over  the  available  stations.  The 
uniform  average  is  a  common  measure  that  is  easy  to  compute  and  has  certain  intuitive  appeal. 
The  optimal  average,  however,  is  more  involved  computationally,  but  with  weights  reflecting  the 
variability  and  correlation  structure  of  the  data  field.  Specifically,  the  weights  were  computed 
using  the  variance-covariance  matrix  of  the  climate  variable  observed  over  the  stations. 

The  Uniform  Average 

The  common  arithmetic  mean  of  a  climate  variable  over  the  number  of  stations  is  a  weighted 
average  with  uniform  weights.  In  the  present  study,  this  is  represented  by 

T  =  t  y>  T,  (1) 

i=\ 

where  A/"  is  the  number  of  stations,  w,  is  the  weight  and  Ti  is  the  observation  at  the  /  *  station. 
The  values  of  the  weights  are  uniform  for  all  stations  and  are  equal  to  \IN.  If  all  stations  report 
data  for  the  month,  then  N=3S.  In  the  case  of  missing  data,  the  averages  over  the  remaining 
stations  were  used  as  estimates.  The  uniform  average  is  optimal  when  the  climate  variables 
measured  at  the  stations  represent  spatial  white  noise.  This  means  the  stations  are  not  correlated, 
and  have  approximately  the  same  variance.  However,  when  such  conditions  do  not  exist,  as  in 
the  case  of  climate  stations,  the  uniform  average  may  be  affected  by  significant  biases. 

The  Optimal  Average 

The  optimal  average  is  also  represented  by  Equation  (1)  above,  but  with  weights  computed  using 
the  variance-covariance  structure  of  the  climate  variable  from  the  stations.  The  mathematical 
details  are  described  in  Appendix  B  and  the  cited  references  therein.  See  also  Shen  et  al,  (1994) 
and  Shen  and  Wang  (1997)  for  climatological  applications.  A  desirable  feature  of  the  optimal 
average  is  that  the  weights  are  obtained  by  minimizing  the  mean  squared  error  between  the  true 
spatial  average  and  the  estimated  average,  with  consideration  of  the  variability  (variance)  and  the 
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correlation  (covariance)  of  the  observations.  In  essence,  a  station  with  larger  variance  and  higher 
correlation  with  other  stations  is  weighted  less.  The  uncertainty  of  the  method  is  mainly  from 
the  estimates  of  the  covariance  structure.  However,  it  has  been  shovsm  that  the  estimate  of 
covariance  is  usually  more  robust  than  the  estimate  of  the  average  of  the  observation  field.  (Kim 
and  North,  1993)  Also,  the  optimal  method  allows  careful  tuning  and  testing  to  optimize  the 
estimate.  The  most  important  tuning  parameter  is  the  length  scale,  which  is  based  on  the  e-fo\d 
criterion  for  decorrelation.  This  length  scale  represents  the  distance  at  which  the  spatial 
correlation  between  pairs  of  stations  have  decreased  to  approximately  \/e  and  must  be  computed 
for  each  climate  variable  involved.  For  Alberta,  the  length  scale  for  temperature  is 
approximately  700  Km  and  that  for  precipitation  is  approximately  220  Km. 

RESULTS 

For  the  maximum  and  minimum  temperatures,  the  monthly  averages  were  first  computed  for 
each  station.  Then  the  anomalies  were  computed  as  departures  of  the  monthly  averages  from  the 
respective  reference  period  averages  for  the  station.  Both  uniform  and  optimal  averaging  were 
applied  over  the  stations  to  obtain  the  Alberta  average  for  each  month.  The  annual  values  were 
then  computed  as  the  arithmetic  means  over  the  twelve  monthly  values.  For  the  present  study, 
the  focus  is  on  the  annual  average  and  the  months  of  January  and  July  for  winter  and  summer 
conditions  respectively.   Similar  calculations  were  performed  for  the  total  precipitation. 

The  resulting  time  sequences  contained  considerable  noise.  An  eleven-year  running  mean  was 
used  to  help  reveal  the  underlying  variations  by  smoothing  the  data  sequences.  Other  kinds  of 
smoothing  techniques  could  also  be  used,  such  as  the  21 -point  binomial  filter  used  in  IPCC 
(1996),  but  the  eleven-year  running  mean  provided  the  necessary  level  of  smoothing  and  has 
certain  convenience  in  its  interpretation.  For  example,  each  value  of  the  running  mean 
approximates  closely  the  decadal  mean  centred  over  the  particular  point  in  time.  Both  the 
uniform  average  and  optimal  average  sequences  were  smoothed  by  the  running  mean  and  the 
results  are  shown  in  Figures  C  to  G,  and  Figures  4  to  7  in  Appendix  A. 
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Figure  C  shows  the  variations  of  the  annual  averages  of  monthly  maximum  temperature 
anomalies  over  Alberta.  Note  that  the  data  shown  are  anomalies  and  the  zero  line  represents  the 
average  over  the  reference  period.  Both  uniform  average  and  optimal  average  show  a  peak 
around  1940  and  1980  and  a  low  period  around  1950s  and  1960s.  The  hottest  year  for  the  study 
period  is  1986.  However,  one  observes  no  general  increase  of  annual  average  maximum 
temperature  over  the  data  period.  An  interesting  feature  is  the  increases  shown  by  the  uniform 
averages  (during  the  period  1920  to  1940  and  around  1900  and  1980)  disappear  when  averaging 
is  done  with  optimal  weights. 

A  more  significant  difference  exists  with  the  annual  averages  of  monthly  minimum  temperature 
anomalies.  Figure  D  shows  that  for  the  minimum  temperature,  the  uniform  averages  before  1930 
are  much  lower  than  the  optimal  averages.  The  result  is  that  a  much  more  significant  warming 
over  a  longer  period  is  observed  using  averages  with  uniform  weights  than  those  with  optimal 
weights.  With  optimal  averages,  an  increasing  trend  appears  to  begin  around  1920  and  continue 
through  1990s  but  is  less  significant.  In  either  case,  it  is  clear  that  the  minimum  temperature 
over  Alberta  has  increased  since  1920.  The  increase  represented  by  the  decadal  means  (centred  at 
1925  and  1991)  is  of  the  order  of  0.8  degree  C  for  the  period  1920  to  1996,  based  on  the  optimal 
averaging  method.  The  main  warming  periods  are  between  1920  and  1950,  and  1970  to  1990. 

The  annual  average  total  precipitation  (Figure  E)  has  peaks  around  1900  and  1950.  Generally, 
precipitation  data  are  much  more  variable.  The  trend  estimates  are  therefore  less  reliable  than  in 
the  temperature  case.  However,  there  appears  to  be  a  slow  upward  trend  observable  from  the 
optimal  averages,  starting  around  1920  and  lasting  till  1990s.  The  estimate  of  increase  in  terms 
of  decadal  means  is  38.1  mm  using  the  optimal  averaging  method.  Annual  average  total 
precipitation  is  now  21.3  mm  more  than  the  average  over  the  reference  period.  This  still  does 
not  exceed  the  peak  observed  around  1900.  Here  again,  the  uniform  averages  show  a  more 
significant  increase,  mainly  because  of  the  lower  estimates  in  the  early  years  of  the  data  period. 
There  were  very  few  stations  in  the  early  years  and  this  contributes  to  lower  reliability  of  the 
estimates  obtained  from  both  methods.  However,  the  optimal  averaging  method  appears  to  be 
more  robust  to  such  effects. 
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For  the  average  July  maximum  temperature  over  the  province,  there  is  no  observable  upward 
trend  over  the  data  period  from  the  optimal  averaging  method.  (See  Figure  F.)  In  fact,  there  has 
been  a  steady  decrease  in  July  maximum  temperature  since  the  1940s  when  a  peak  occurred. 
This  decrease  is  more  significant  from  the  uniform  averages,  which  register  approximately  one 
degree  higher  in  the  period  before  1940.  No  obvious  trend  is  observed  in  the  January  maximum 
temperature  (Figure  4). 

The  average  minimum  temperature  in  July  over  the  province  (Figure  G)  also  peaks  in  the  1900s 
based  on  the  optimal  averages  and  there  is  no  clear  rising  trend  unless  one  considers  the  period 
after  1950.  An  interesting  feature  in  Figure  G  is  that  the  uniform  averages  show  a  steady 
increase  of  minimum  temperature  from  the  1910s.  Since  there  is  no  obvious  trend  in  January 
minimum  temperature  (Figure  5)  consistent  with  the  period  1920  to  1990  it  appears  that  the 
aforementioned  increasing  trend  in  annual  minimum  temperature  is  not  reflected  in  the  data  of 
January  or  July.  It  would  be  useftil  to  flirther  examine  the  month  by  month  minimum 
temperature  variation  and  identify  those  months  that  have  a  significant  increase  in  minimum 
temperature. 

The  optimal  average  of  total  precipitation  for  January  (Figure  6)  does  not  show  any  observable 
trend.  The  variation  over  the  data  period  does  not  differ  much  from  the  average  over  the 
reference  period.  The  uniform  average  results  are  more  variable  and  show  an  increasing  trend 
till  the  1970s  and  decreasing  trend  since  then.  There  is  greater  agreement  between  the  two 
averaging  methods  in  the  case  of  July  total  precipitation,  although  the  optimal  averages  still 
demonstrate  less  significant  changes.  There  is  a  discernible  small  increase  in  July  precipitation 
from  about  1 920  to  1 990s  (Figure  H). 

DISCUSSION 

The  optimal  average  provides  a  more  conservative  estimate  of  climatic  trends  than  the  uniform 
average.  It  also  has  the  theoretical  basis  for  a  better  integral  measure  of  climate  variables  over  a 
region.  It  was  demonstrated  in  the  present  study  that  using  uniform  averaging  to  obtain  Alberta 
climatic  trends  results  in  inflated  estimates.  This  is  in  contrast  with  climatic  averages  over  larger 
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areas  which  have  shown  insensitivity  to  spatial  averaging  techniques  (Parker,  1994).  The  focus 
is  therefore  on  the  optimal  averages  in  the  present  description  of  Alberta  climatic  trends. 

No  attempts  were  made  to  fit  a  straight  line  to  the  data  for  trend  estimate  in  this  study,  as  this 
assumes  linearity,  which  oversimplifies  the  actual  change.  Instead,  it  was  decided  to  use  the 
difference  between  smoothed  end  point  values  of  a  period  of  general  trend  as  an  estimate  for  the 
magnitude  of  change,  and  use  smoothing  filters  to  show  the  pattern  of  variation  (IPCC,  1990, 
1996).  However,  it  would  be  useful  to  estimate  the  linear  trend  and  perform  significance  tests  in 
a  future  study  to  provide  an  alternative  estimate  and  confirm  the  results  in  probabilistic  terms. 

It  would  also  be  useful  to  examine  the  minimum  temperature  trend  from  other  months  in  future 
studies  to  identify  those  with  significant  warming.  Climate  models  estimate  greater  warming  in 
northern  latitudes  and  that  late  fall  and  winter  months  would  see  the  largest  warming  effect 
(IPCC,  1996,  p.305).  This  is  not  supported  by  at  least  the  January  temperature  observations 
shown  in  the  present  analysis.  The  question  remains  whether  this  difference  from  model 
estimates  is  a  widespread  feature  or  a  regional  exception. 

CONCLUSIONS 

Historical  variations  in  temperature  and  precipitation  in  Alberta  were  assessed  using  both  spatial 
and  temporal  smoothing  techniques.  Optimal  averages  computed  over  Alberta  were  smoothed 
using  an  eleven-year  running  mean  in  an  effort  to  reveal  long-term  trends  in  these  climate 
variables.  The  present  discussion  is  limited  to  observable  trends  since  around  1920,  in 
consideration  of  the  number  of  available  stations  and  the  length  of  period  involved.  The 
following  features  are  observed: 

•    The  annual  average  maximum  temperature  over  Alberta  has  peaks  around  1940  and  1980. 
However,  there  is  no  obvious  upward  trend  in  maximum  temperature  over  Alberta. 
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•    The  annual  average  minimum  temperature  over  Alberta  has  increased  since  about  1920.  The 
increase  is  about  0.8  degree,  based  on  estimates  using  the  optimal  averaging  method.  The 
main  warming  periods  are  between  1920  and  1950,  and  1970  to  1990. 


•  There  is  no  observable  long-term  increase  in  January  or  July,  maximum  or  minimum 
temperatures  over  Alberta  since  1920.  This  is  inconsistent  with  the  increase  in  annual 
minimum  temperature  observed  and  indicates  significant  month-to-month  variations.  It  is 
worthwhile  to  examine  the  minimum  temperature  trend  for  each  month  and  identify  the 
months  with  significant  warming. 

•  The  annual  total  precipitation  over  Alberta  shows  a  slow  upward  trend  from  1920s  to  the 
present.  The  increase  was  estimated  to  be  around  38.1  mm.  There  is  also  an  increase  in  July 
total  precipitation  over  Alberta  since  1920,  but  no  clear  trend  is  identified  in  the  January  total 
precipitation.  The  July  increase  is  approximately  15mm.  However,  because  of  the  greater 
variability  of  precipitation  data,  the  accuracy  of  these  estimates  needs  to  be  further  assessed. 

•  It  would  be  useful  to  further  analyze  these  trends  and  assess  their  significance  using 
statistical  tests. 
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APPENDIX  A:  TABLES  AND  FIGURES 

List  of  Tables: 

Table  1 :  Selected  long-term  climate  stations  in  Alberta  . 

Table  2:  Monthly  average  of  daily  maximum  temperature  (degree  C)  over  the  reference  period. 

Table  3:  Monthly  average  of  daily  minimum  temperature  (degree  C)  over  the  reference  period. 

Table  4:  Monthly  average  of  total  rainfall  (mm)  over  the  reference  period. 

Table  5:  Monthly  average  of  total  snowfall  (mm  of  water  equivalent)  over  the  reference  period. 

Table  6:  Monthly  average  total  precipitation  (mm)  over  the  reference  period. 

Table  7:  Monthly  variance  of  daily  maximum  temperature 

Table  8:  Monthly  variance  of  daily  minimum  temperature 

Table  9:  Monthly  variance  of  total  rainfall 

Table  10:  Monthly  variance  of  total  snowfall 

Table  1 1 :  Monthly  variance  of  total  precipitation 

List  of  Figures: 

Figure  1 :  A  map  of  the  selected  climate  stations. 
Figure  2:  Number  of  stations  measuring  temperature. 
Figure  3:  Number  of  stations  measuring  precipitation. 

Figure  4:  Uniform  and  optimal  averages  of  January  maximum  temperature  anomalies  smoothed 
with  the  1 1-year  running  mean. 

Figure  5:  Uniform  and  optimal  averages  of  January  minimum  temperature  anomalies  smoothed 
with  the  1 1-year  running  mean. 

Figure  6:  Uniform  and  optimal  averages  of  January  total  precipitation  anomalies  smoothed  with 
the  1 1-year  running  mean. 
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APPENDIX  B:  THE  MATHEMATICS  OF  THE  OPTIMAL  AVERAGE 


An  Assessment  of  the  Change  in  Temperature  and  Precipitation  in  Alberta  A19 


Appendix  B 


Derivation  of  the  optimal 
averaging  scheme 

In  this  appendix  we  use  the  temperature  field  as  an  example  to  illustrate  our  optimal 
method. 

B.l    Weighted  estimator  of  the  averaged  anomalies 

We  wish  to  measure  the  regional  averaged  temperature  Tr(*)  time  averaged  through  the 
interval  r  centered  at  t  over  the  region  R.  The  true  value  of  this  quantity  is 

fAt)  =  jJjr{i,t)dA  (B.1) 

where 

Tr(f,t)  =  i  /        T{r,l/)dt'  (B.2) 

is  the  T-length  averaging.  The  value  of  r  can  be  a  month,  several  months  (for  seasonal 
averages),  a  year  (for  annual  averages),  or  several  years.  In  the  above  formulas,  f  is  a 
vector  that  determines  the  position  of  the  point  in  question,  T(f ,  i)  is  the  temperature  at 
f  and  t,  dA  is  the  integration  area  element  and  A  is  the  area  of  the  region  R,  i.e.  Alberta. 

Departures  of  the  r-average  firom  the  ensemble  average  are  due  to  natural  variabiHty. 
Such  a  departure  is  called  an  anomaly: 

ATr{r,t)  =  Tr{r,t)  -  (T.(f,t))  (B.3) 

where  {Tr(f,*))  denotes  the  ensemble  average  of  Tr(f,t).  Similarly,  we  can  define  an 
anomcdy  of  the  Alberta  average  temperature: 

Afr(t)=fr{t)-{Tr(t)).  (B.4) 


By  definition 


<Af.(t))=0. 


(B.5) 


In  what  follows  we  will  deal  with  the  anomaly  field  and  its  Alberta  average.  To  keep  the 
notation  simple  we  drop  the  prefix  A. 

The  global  average  temperature  anomaly  may  be  estimated  firom  the  data  streams 
collected  from  a  given  network  of  N  stations  {fj,  j  =  1, 2, 3,  •  •  • ,  N}  by  the  estimator 

Tr(t)  =  'twjTr{tj,t)  (B.6) 

where  wj  is  the  weight  assigned  to  the  j^^  station.  We  impose  the  no-bias  constraint  on 
the  weights 

E^i  =  l.  (B.7) 
Now  we  may  form  the  mean  square  error  (MSE) 

€2  =  ((2V-T,)2)  (B.8) 

where  is  a  function  of  the  weight  vector  w  =  {wj}.  A  natural  question  is  what  is  the 
size  of  €^  and  how  does  it  vary  for  different  choices  of  the  station  network  {tj}  and  of 
the  weight  vector  w.  In  particular,  one  wish  to  know  the  best  choice  of  w,  in  the  sense 
of  minimum  e^,  for  a  given  configuration  {fj},  since  the  latter  represents  the  locations  of 
historical  stations,  like  the  38  stations  used  in  our  study. 
Expanding  the  formula  for  the  mean  square  error  leads  to 

£2     =  ((fr-Tr){fr-Tr)) 

where  we  have  introduced  the  temporally  smoothed  covariance 

p,{f',f")s(r,(f',t)T,(f",t)).  (B.9) 

All  second  moment  information  about  the  anomaly  field  averaged  over  the  interval  r  is 
contained  in  this  temporally  smoothed  covariance  function. 

B.2    Optimization  of  weights  ' 

We  minimize  the  MSE  subject  to  the  no-bias  constraint  (B.7).  This  constrained  mini- 
mization problem  can  be  solved  using  the  method  of  Lagrange  multipliers.  We  simply 
extremize 

"  N 

(B.IO) 


J[w]  =  e2[w]  -  2A 


where  A  is  a  Lagrange  multiplier  and  is  always  regarded  as  being  a  constant  in  the  com- 
putational procedures  following.  This  philosophy  is  the  same  as  the  method  of  optimal 
statistical  averaging  (e.g.,  Gandin,  1993).  Taking  the  partial  derivatives  of  J[w]  with 
respect  to  the  weights  and  the  Lagrange  multiplier  and  setting  them  individually  to  zero: 

0,  z  =  l,...,iV,  (B.ll) 

0.  (B.12) 

Inserting  the  expression  for     results  in 

"^^WjPriTi.rj)  -  A   =7  /  Pr{r,Ti)dA,  i  =  l,...,iV,  (B.13) 
Y.Wi   =   1,  (B.14) 

The  above  system  is  for  iVH- 1  unknowns,  the  N  weights  {wi}  and  the  Lagrange  multiplier 
A. 


dJ 
dwi 
d£ 
dA 


B.3    Computational  algorithm 

Attention  is  needed  for  the  different  lengths  of  the  data  streams:  some  stations  started 
at  1880  and  some  at  1944  or  later.  The  missing  data  for  for  a  given  station  are  replaced 
by  the  arithmetic  average  of  the  data  from  the  other  stations  of  the  same  month.  For 
trend  detection  this  procedure  does  not  cause  much  signal  deformation.  Between  1880 
and  1883,  there  were  at  most  2  stations  in  operation  and  for  some  months  there  were  no 
data  reports  from  either  of  the  stations.  Since  the  above  arithmetic  averaging  to  fill  up 
the  missing  data  needs  at  least  one  station,  it  is  impossible  to  assess  the  climate  condition 
with  no  data.  Thus  in  the  following  we  conduct  our  analysis  from  1884.  After  this  initial 
step  of  filling  missing  data,  all  the  38  stations  have  their  data  from  1884  to  1996. 

According  to  the  formulation  in  the  above  two  sections  (see  equations  (B.13)  and 
(B.14)  in  particular),  the  crucial  part  is  to  calculate  the  covariance  matrix  pr(fi,fj) 
and  the  average  \  fj^pr{f,ti)dA.  Since  it  is  not  possible  to  conduct  true  repeated  cli- 
mate experiment,  the  ensemble  is  replaced  by  discrete  realizations  over  the  years.  Hence, 
Pr(fi,fj)  =  (Tr(ft,t)Tr(fj,t))  is  approximated  by 

1  ^ 

Pr(fi,f,)  =  --^Tr(ti,k)Tr(rj,k)  (B.15) 

where  K,  in  the  unit  [year],  is  the  length  of  the  longest  data  stream.  In  theory,  if  is 
sufficiently  large  and  the  temperature  field  Tr(f ,  t)  is  ergotic  in  time,  the  above  should  con- 
verge to  the  true  covariance.  But,  due  to  serial  correlation  in  each  data  stream,  the  above 


expression  sometimes  can  be  far  from  convergence  and  hence  causes  some  uncertainties  of 
this  method. 

The  average  ^  Jj^pr(t,ti)dA  basically  measures  the  strength  of  the  climate  signal  of 
a  neighborhood  of  the  station  i  and  the  area  the  station  i  can  reach.  In  particular,  if  the 
anomaly  field  is  isotropic,  a  reasonable  empirical  expression  for  px(f,  fj): 


(B.16) 


prir.ti)  =C7if7oexp 

where     is  the  variance  of  ith  station  and 

J  38 
1=1 

is  the  averaged  variance,  a  is  the  correlation  length,  i.e.  the  length  scale,  of  the  field,  a 
and  P  are  empirical  coefficients  (see  Vinnikov  et  al.  (1990)).  According  to  the  results  of 
Kim  and  North  (1993)  and  Hansen  and  Lebedefi"  (1987),  the  length  scale  may  vary  from 
700  [km]  to  2000  [km]  depending  on  the  length  of  r  and  whether  the  region  jR  is  land  or 
ocean.  Our  e-fold-correlation  tests  support  that  a  =  700  km  for  temperature  and  220  km 
for  precipitation  in  Alberta.  The  area  of  Alberta  is  661,185  km^. 
We  take  the  following  approximation 

-^^Pr(r,fi)rfA«7<7i<7o~-.  (B.17) 

In  this  expression,  7  is  a  dimensionless  constant  equal  to  l/(7re).  The  empirical  values  for 
a  and  7  can  vary  from  region  to  region.  The  best  way  is  yet  to  be  found  to  tune  these 
parameters. 

Now  the  linear  equations  (B.13)  and  (B.14)  can  be  solved  to  obtain  the  values  of  the 
weights  wi,W2i  •  •  • ,  wjsf  and  the  Lagrange  multiplier  A.  With  this  result,  one  can  compute 
and  plot  the  weighted  average  of  the  climate  anomalies. 
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